Smart Indexes for Efficient Browsing of Library Collections

نویسندگان

  • Steven Geffner
  • Divyakant Agrawal
  • Amr El Abbadi
  • Terence R. Smith
  • Mary Larsgaard
چکیده

To enable efficient browsing and interactive querying of very large collections, such as those found in digital libraries, it is essential to provide users with summaries of query result sets. Smart indexes can be used to generate summary statistics, aggregated classification information, and/or aggregated contentbased information for the result sets of arbitrary queries. We present the basic model of a smart index, as well as variations of smart indexes that are suitable when the size of summaries is large. An algorithm for generating summaries of the results of arbitrary queries is given, and algorithms for updating various summaries are discussed. Experimental results show that smart indexes generate summaries much more efficiently than traditional trees for all query areas greater than 1%-2% of the data space, with a relatively small additional storage overhead. Contrary to traditional trees, smart indexes in general perform better as the query area grows larger.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Virtual Digital Library

DIGMAP is a digital library specialized in searching and browsing services for old maps and related resources. The service reuses metadata from national libraries and other relevant third party metadata sources, providing added value services by aggregating all the data in comprehensive collections, browsing indexes and search functions. The services are based in a set of specialized tools, com...

متن کامل

Arthistorian: an Integrated Indexing and Personalized Browsing System for Art Paintings

This paper introduces ArtHistorian, an art painting indexing system designed for automatic generation of dynamic painting presentations. The developed system aims personalized content browsing with a classification based indexing and query method implemented from the art historians’ perspective. ArtHistorian represents the visual content of paintings by a 6-D feature vector that is robust to sc...

متن کامل

Optimised Phrase Querying and Browsing of Large Text Databases

Most search systems for querying large document collections—for example, web search engines—are based on well-understood information retrieval principles. These systems are both efficient and effective in finding answers to many user information needs, expressed through informal ranked or structured Boolean queries. Phrase querying and browsing are additional techniques that can augment or repl...

متن کامل

Browsing and book selection in the physical library shelves

Library users should be conveniently interact with collections and be able to easily choose books of interest as they explore and browse a physical book collection. While there exists a growing body of naturalistic studies of browsing and book selection in digital collections, the corresponding literature on behaviour in the physical stacks is surprisingly sparse. We add to this literature in t...

متن کامل

Searching and Browsing Collections of Structural Information

This paper proposes a new approach to querying collections of structured textual information such as SGML/XML documents. Knowledge about the structure of documents is an additional resource that should be exploited during retrieval since the semantics of the different textual objects can be used to specify an information need much more precisely. However, the traditional probabilistic retrieval...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998